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SEQUENCE LISTING 

4 (1) GENERAL INFORMATION: 



6 


(i) 


APPLICANT: Jacobs, Kenneth 


7 




McCoy, John M. 


8 




LaVallie, Edward R. 


9 




Collins-Racie, Lisa A. 


10 




Evans, Cheryl 


11 




Merberg, David 


12 




Treacy, Maurice 


13 




Spaulding, Vikki 


15 


(ii) 


TITLE OF INVENTION: SECRETED PROTEINS AND POLYNUCLEOTIDES 


16 




ENCODING THEM 


18 


(iii) 


NUMBER OF SEQUENCES: 219 


20 


(iv) 


CORRESPONDENCE ADDRESS: 


21 




(A) ADDRESSEE: Genetics Institute, Inc. 


22 




(B) STREET: 87 Cambridge Park Drive 


23 




(C) CITY: Cambridge 


24 




(D) STATE: MA 


25 




(E) COUNTRY: U.S.A. 


26 




(F) ZIP: 02140 


28 


(v) 


COMPUTER READABLE FORM: 


29 




(A) MEDIUM TYPE: Floppy disk 


30 




(B) COMPUTER: IBM PC compatible 


31 




(C) OPERATING SYSTEM: PC-DOS/MS-DOS 


32 




(D) SOFTWARE: Patentln Release #1.0, Version #1.30 


34 


(vi) 


CURRENT APPLICATION DATA: 


35 




(A) APPLICATION NUMBER: US/09/745,763 


36 




(B) FILING DATE: 18-Jun-2000 


37 




( C ) CLASSIFI CAT I ON : 


39 


(viii) 


ATTORNEY /AGENT INFORMATION: 


40 




(A) NAME : Sprunger, Suzanne A. 


41 




(B) REGISTRATION NUMBER: 41,323 


43 


(ix) 


TELECOMMUNICATION INFORMATION: 


44 




(A) TELEPHONE: (617) 498-8284 


45 




(B) TELEFAX: (617) 876-5851 


48 


(2) INFORMATION FOR SEQ ID NO: 1: 


50 


(i) 


SEQUENCE CHARACTERISTICS: 


51 




(A) LENGTH: 1800 base pairs 


52 




(B) TYPE: nucleic acid 


53 




'(C) STRANDEDNESS: double 


54 




(D) TOPOLOGY: linear 


55 


(ii) 


MOLECULE TYPE: cDNA 


60 


(xi) 


SEQUENCE DESCRIPTION: SEQ ID NO: 1: 



62 TTTTTTTTTT TACAGACTTC ACAGAGAATG CAGTTGTCTT GACTTCAGGT CTGTCTGTTC 60 

64 TGTTGGCAAG TAAATGCAGT ACTGTTCTGA TCCCGCTGCT ATTAGAATGC ATTGTGAAAC 120 

66 GACTGGAGTA TGATTAAAAG TTGTGTTCCC CAATGCTTGG AG TAG T GAT T GTTGAAGGAA 180 

68 AAAATCCAGC TGAGTGATAA AGGCTGAGTG TTGAGGAAAT TTCTGCAGTT TTAAGCAGTC 24 0 
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70 GTATTTGTGA TTGAAGCTGA GTACATTTTG CTGGTGTATT TTTAGGTAAA ATGCTTTTTG 300 

72 TTCATTTCTG GTGGTGGGAG GGGACTGAAG CCTTTAGTCT TTTCCAGATG CAACCTTAAA 360 

74 ATCAGTGACA AGAAACATTC CAAACAAGCA ACAGTCTTCA AGAAATTAAA CTGGCAAGTG 4 20 

76 GAAATGTTTA AACAGTTCAG TGATCTTTAG TGCATTGTTT ATGTGTGGGT TTCTCTCTCC 4 80 

78 CCTCCCTTGG TCTTAATTCT TACATGCAGG AACACTCAGC AGACACACGT ATGCGAAGGG 540 

80 CCAGAGAAGC CAGACCCAGT AAGAAAAAAT AGCCTATTTA CTTTAAATAA ACCAAACATT 600 

82 CCATTTTAAA TGTGGGGATT GGGAACCACT AGTTCTTTCA GATGGTATTC TTCAGACTAT 660 

84 AGAAGGAGCT TCCAGTTGAA TTCACCAGTG GACAAAATGA GGAAAACAGG TGAACAAGCT 720 

86 TTTTCTGTAT TTACATACAA AGTCAGATCA GTTATGGGAC AATAGTATTG AATAGATTTC 780 

88 AGCTTTATGC TGGAGTAACT GGCATGTGAG CAAACTGTGT TGGCGTGGGG GTGGAGGGGT 840 

90 GAGGTGGGCG CTAAGCTTTT TTTAAGATTT TTCAGGTACC CTTCACTAAA GGCACCGAAG 900 

92 GCTTAAAGTA GG AC AAC CAT GGAGCTTCCT GTGGCAGGAG AGACAACAAA GCGCTATTAT 960 

94 CCTAAGGTCA AGAGAAGTGT CAGCCTCACC TGATTTTTAT TAGTAATGAG GACTTGCCTC 1020 

96 AACTCCCTCT TTCTGGAGTG AAGCATCCGA AGGAATGCTT GAAGTACCCC TGGGCTTCTC 1080 

98 TTAACATTTA AGCAAGCTGT TTTTATAGCA GCTCTTAATA ATAAAGCCCA AATCTCAAGC 114 0 

100 GGTGCTTGAA GGGGAGGGAA AGGGGGAAAG CGGGCAACCA CTTTTCCCTA GCTTTTCCAG 1200 

102 AAGCCTGTTA AAAGCAAGGT CTCCCCACAA GCAACTTCTC TGCCACATCG CCACCCCGTG 12 60 

104 CCTTTTGATC TAG C AC AG AC CCTTCACCCC TCACCTCGAT GCAGCCAGTA GCTTGGATCC 1320 

106 TTGTGGGCAT GATCCATAAT CGGTTTCAAG GTAACGATGG TGTCGAGKTC TTTGGTGGGT 1380 

108 TGAACTATGT TAGAAAAGGC CATTAATTTG CCTGCAAATT GTTAACAGAA GGGTATTAAA 14 4 0 

110 ACCACAGCTA AGTAGCTCTA TTATAATACT TATCCAGTGA CTAAAACCAA CTTAAACCAG 1500 

112 TAAGTGGAGA AATAACATGT TCAAGAACTG TAATGCTGGG TGGGAACATG TAACTTGTAG 1560 

114 ACTGGAGAAG ATAGGCATTT GAGTGGCTGA GAGGGCTTTT GGGTGGGAAT GCAAAAATTC 162 0 

116 TCTGCTAAGA CTTTTTCAGG TGAACATAAC AGACTTGGCC AAGCTAGCAT CTTAGCGGAA 1680 

118 GCTGATCTCC AATGCTCTTC AGTAGGGTCA TGAAGGTTTT TCTTTTCCTG AGAAAACAAC 17 4 0 

120 ACGTATTGTT TTCTCAGGTT TTGCTTTTTG GCCTTTTTCT AGCTTAAAAA AAAAAAAAAA 18 00 
124 (2) INFORMATION FOR SEQ ID NO: 2: 

126 (i) SEQUENCE CHARACTERISTICS: 

127 (A) LENGTH: 4 8 amino acids 

128 (B) TYPE: amino acid 

129 (C) STRANDEDNESS: 

130 (D) TOPOLOGY: linear 
132 (ii) MOLECULE TYPE: protein 

135 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

137 Val Trp Val Ser Leu Ser Pro Pro Leu Val Leu lie Leu Thr Cys Arg 

138 15 10 15 

140 Asn Thr Gin Gin Thr His Val Cys Glu Gly Pro Glu Lys Pro Asp Pro 

141 20 25 30 

14 3 Val Arg Lys Asn Ser Leu Phe Thr Leu Asn Lys Pro Asn lie Pro Phe 

144 35 40 45 
14 8 (2) INFORMATION FOR SEQ ID NO: 3: 

150 (i) SEQUENCE CHARACTERISTICS: 

151 (A) LENGTH: 1063 base pairs 

152 (B) TYPE: nucleic acid 

153 (C) STRANDEDNESS: double 

154 (D) TOPOLOGY: linear 
156 (ii) MOLECULE TYPE: cDNA 

159 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

161 AAAGTTCCAT CTCTAGAACT GATTTTTATC CGTTCTGTTT TTCAGGTCTT ATCTGTGTTA 60 
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163 GTTGTGTGTT ACTATCAGGA GGCCCCCTTT GGACCCAGTG GATACAGATT ACGACTCTTC 120 

165 TTTTATGGTG TATGCAATGT CATTTCTATC ACTTGTGCTT ATACATCATT TTCAATAGTT 180 

167 CCTCCCAGCA ATGGGACCAC TATGTGGAGA GCCACAACTA CAGTCTTCAG TGCCATTTTG 24 0 

169 GCTTTTTTAC TCGTAGATGA GAAAATGGCT TATGTTGACA TGGCTACAGT TGTTTGCAGC 300 

171 ATCTTAGGTG TTTGTCTTGT CATGATCCCA AACATTGTTG ATGAAGACAA TTCTTTGTTA 360 

173 AATGCCTGGA AAGAAGCCTT TGGGTACACC ATGACTGTGA TGGCTGGACT GACCACTGCT 420 

175 CTCTCAATGA TAG TAT AC AG ATCCATCAAG GAGAAGATCA GCATGTGGAC TGCACTGTTT 4 80 

177 ACTTTTGGTT GGACTGGGAC AATTTGGGGA ATATCTACTA TGTTTATTCT TCAAGAACCC 54 0 

179 ATCATCCCAT TAGATGGAGA AACCTGGAGT TATCTCATTG CTATATGTGT CTGTTCTACT 600 

181 GCAGCATTCT TAGGAGTTTA TTATGCCTTG GACAAATTCC ATCCAGCTTT GGTTAGCACA 660 

183 GTACAACATT TGGAGATTGT GGTAGCTATG GTCTTGCAGC TTCTCGTGCT GCACATATTT 7 20 

185 CCTAGCATCT ATGATGTTTT TGGAGGGGTA ATCATTATGA TTAGTGTTTT TGTCCTTGCT 780 

187 GGCTATAAAC TTTACTGGAG GAATTTAAGA AGGCAGGACT ACCAGGAAAT ATTAGACTCT 84 0 

18 9 CCCATTAAAT GAATACCTGA TTATTATTGT CTCATTAATG TTCAGTTATT AATATGTATA 900 

191 CTGCCATTTT AATGTTTACC TATGAATGTC TTTTGTGTTA TATAACTGAC AGAGTGCTAT 960 

193 AAAATATATA ATATATACAA ATGCAGAAAA TTTATTCTAG TCTAATATAT TCAAATACAA 1020 

195 ATATTAAATA TATGAAATAC GTTAAAAAAA AAAAAAAAAA AAA 1063 

198 (2) INFORMATION FOR SEQ ID NO: 4: 

200 (i) SEQUENCE CHARACTERISTICS: 

201 (A) LENGTH: 216 amino acids 

202 (B) TYPE: amino acid 

203 (C) STRANDEDNESS: 

204 (D) TOPOLOGY: linear 
206 (ii) MOLECULE TYPE: protein 

209 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 



211 


Met 


Trp 


Arg 


Ala 


Thr 


Thr 


Thr 


Val 


Phe 


Ser 


Ala 


lie 


Leu 


Ala 


Phe 


Leu 


212 


1 








5 










10 










15 




214 


Leu 


Val 


Asp 


Glu 


Lys 


Met 


Ala 


Tyr 


Val 


Asp 


Met 


Ala 


Thr 


Val 


Val 


Cys 


215 








20 










25 










30 






216 


Ser 


He 


Leu 


Gly 


Val 


Cys 


Leu 


Val 


Met 


He 


Pro 


Asn 


lie 


Val 


Asp 


Glu 


217 






35 










40 










45 








219 


Asp 


Asn 


Ser 


Leu 


Leu 


Asn 


Ala 


Trp 


Lys 


Glu 


Ala 


Phe 


Gly 


Tyr 


Thr 


Met 


220 




50 










55 










60 










222 


Thr 


Val 


Met 


Ala 


Gly 


Leu 


Thr 


Thr 


Ala 


Leu 


Ser 


Met 


He 


Val 


Tyr 


Arg 


223 


65 










70 










75 










80 


225 


Ser 


He 


Lys 


Glu 


Lys 


He 


Ser 


Met 


Trp 


Thr 


Ala 


Leu 


Phe 


Thr 


Phe 


Gly 


226 










85 










90 










95 




228 


Trp 


Thr 


Gly 


Thr 


He 


Trp 


Gly 


He 


Ser 


Thr 


Met 


Phe 


He 


Leu 


Gin 


Glu 


229 








100 










105 










110 






231 


Pro 


He 


He 


Pro 


Leu 


Asp 


Gly 


Glu 


Thr 


Trp 


Ser 


Tyr 


Leu 


He 


Ala 


He 


232 






115 










120 










125 








234 


Cys 


Val 


Cys 


Ser 


Thr 


Ala 


Ala 


Phe 


Leu 


Gly 


Val 


Tyr 


Tyr 


Ala 


Leu 


Asp 


235 




130 










135 










140 










237 


Lys 


Phe 


His 


Pro 


Ala 


Leu 


Val 


Ser 


Thr 


Val 


Gin 


His 


Leu 


Glu 


lie 


Val 


238 


145 










150 










155 










160 


240 


Val 


Ala 


Met 


Val 


Leu 


Gin 


Leu 


Leu 


Val 


Leu 


His 


lie 


Phe 


Pro 


Ser 


He 


241 










165 










170 










175 




243 


Tyr 


Asp 


Val 


Phe 


Gly 


Gly 


Val 


He 


He 


Met 


lie 


Ser 


Val 


Phe 


Val 


Leu 


244 








180 










185 










190 
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24 6 Ala Gly Tyr Lys Leu Tyr Trp Arg Asn Leu Arg Arg Gin Asp Tyr Gin 

247 195 200 205 

249 Glu lie Leu Asp Ser Pro lie Lys 

250 210 215 
254 (2) INFORMATION FOR SEQ ID NO: 5: 

256 (i) SEQUENCE CHARACTERISTICS: 

257 (A) LENGTH: 356 base pairs 

258 (B) TYPE: nucleic acid 

259 (C) STRANDEDNESS: double 

260 (D) TOPOLOGY: linear 
262 (ii) MOLECULE TYPE: cDNA 

265 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 



267 TGGCCAAAGA GGCCTAGCCG GGAGCGGGCG AGGCGGCGGC GGCAGCAGCG ATGGCAGGAA 60 
269 TAGAGTTGGA GCGGTGCCAG CAGCAGGCGA ACGAGGTGAC GGAAATTATG CGTAACAACT 120 
271 TCGGCAAGGT CCTGGAGCGT GGTGTGAAGC TGGCCGAACT GCAGCAGCGT TCAGACCAAC 180 
27 3 TCCTGGATAT GAGCTCAACC TTCAACAAGA CTACACAGAA CCTGGCCCAG AAGAAGTGCT 24 0 

27 5 GGGAGAACAT CCGTTACCGG ATCTGCGTGG GGCTGGTGGT GGTTGGTGTC CTGCTCATCA 300 
27 7 TCCTGATTGT GCTGCTGGTC GTCTTTCTCC CTCAGAGCAG TGACAGCAGT AGTGCC 356 
280 (2) INFORMATION FOR SEQ ID NO: 6: 

282 (i) SEQUENCE CHARACTERISTICS: 

283 (A) LENGTH: 102 amino acids 

284 (B) TYPE: amino acid 

285 (C) STRANDEDNESS: 

286 (D) TOPOLOGY: linear 
288 (ii) MOLECULE TYPE: protein 

291 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 



293 


Met 


Ala 


Gly 


He 


Glu 


Leu 


Glu 


Arg 


Cys 


Gin 


Gin 


Gin 


Ala 


Asn 


Glu 


Val 


294 


1 








5 










10 










15 




296 


Thr 


Glu 


He 


Met 


Arg 


Asn 


Asn 


Phe 


Gly 


Lys 


Val 


Leu 


Glu 


Arg 


Gly 


Val 


297 








20 










25 










30 






299 


Lys 


Leu 


Ala 


Glu 


Leu 


Gin 


Gin 


Arg 


Ser 


Asp 


Gin 


Leu 


Leu 


Asp 


Met 


Ser 


300 






35 










40 










45 








302 


Ser 


Thr 


Phe 


Asn 


Lys 


Thr 


Thr 


Gin 


Asn 


Leu 


Ala 


Gin 


Lys 


Lys 


Cys 


Trp 


303 




50 










55 










60 










305 


Glu 


Asn 


He 


Arg 


Tyr 


Arg 


He 


Cys 


Val 


Gly 


Leu 


Val 


Val 


Val 


Gly 


Val 


306 


65 










70 










75 










80 


308 


Leu 


Leu 


He 


He 


Leu 


He 


Val 


Leu 


Leu 


Val 


Val 


Phe 


Leu 


Pro 


Gin 


Ser 


309 










85 










90 










95 




311 


Ser 


Asp 


Ser 


Ser 


Ser 


Ala 























312 100 

315 (2) INFORMATION FOR SEQ ID NO: 7: 

317 (i) SEQUENCE CHARACTERISTICS: 

318 (A) LENGTH: 92 base pairs 

319 (B) TYPE: nucleic acid 

320 (C) STRANDEDNESS: double 

321 (D) TOPOLOGY: linear 
323 (ii) MOLECULE TYPE: cDNA 

325 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

327 AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA 60 
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329 AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AA 92 
332 (2) INFORMATION FOR SEQ ID NO: 8: 
334 (i) SEQUENCE CHARACTERISTICS: 



335 (A) LENGTH: 1131 base pairs 

336 (B) TYPE: nucleic acid 

337 (C) STRANDEDNESS : double 

338 (D) TOPOLOGY: linear 



34 0 (ii) MOLECULE TYPE: cDNA 

343 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

34 5 GGGCCTCAAC TTTGGCGTCG TGAGATTCTT GTGAGGCGTC TGCCTGGAAG CCGGCAGCAA 60 

34 7 TTTTGCTTCT TTAAAGAGAA AAAGAAGGCT AGGGACTCAG ATTCCTGGAT TCTGAGATCC 120 

34 9 AGACCAGCTC CTCCCAGACC TCTCCAGAAG AAGCCATGGG AACCCCTCGT ATCCAGCATT 180 

351 TGCTGATCCT CCTGGTCCTA GGAGCCTCCC TCCTGACCTC GGGCCTAGAG CTGTATTGTC 24 0 

353 AAAAGGGTCT GTCCATGACT GTGGAAGCAG ATCCAGCCAA TATGTTTAAC TGGACCACAG 300 

355 AGGAAGTGGA GACTTGTGAC AAAGGGGCAC TTTGCCAGGA AACCATACTA ATAATTAAAG 360 

357 CAGGGACTGA GACAGCCATT TTGGCCACGA AGGGCTGCAT CCCGGAAGGG GAGGAGGCCA 420 

359 TAACAATTGT CCAGCACTCT TCACCTCCCG GCCTGATCGT GACCTCCTAC AGTAACTACT 4 80 

361 GTGAGGATTC CTTCTGTAAT GACAAAGACA GCCTGTCTCA GTTTTGGGAG TTCAGTGAGA 54 0 

363 CCACAGCTTC CACTGTGTCA ACAACCCTCC ATTGTCCAAC CTGTGTGGCT TTGGGGACCT 600 

365 GTTTCAGTGC TCCTTCTCTT CCCTGTCCCA ATGGTACAAC TCGATGCTAT CAAGGAAAAC 660 

367 TTGAGATCAC TGGAGGTGGC ATTGAGTCGT CTGTGGAGGT CAAAGGCTGT ACAGCCATGA 720 

369 TTGGCTGCAG GCTGATGTCT GGAATCTTAG CAGTAGGACC CATGTTTGTG AGGGAAGCGT 780 

371 GCCCACATCA GCTGCTCACT CAACCTCGAA AGACTGAAAA TGGGGCCACC TGTCTTCCCA 84 0 

373 TTCCTGTTTG GGGGTTACAG CTACTGCTGC CATTGCTGCT GCCATCATTT ATTCACTTTT 900 

375 CCTAAGAAGG CACTTCTGGG CCTGGGTCTG AGGACATCTT TTTTGACTGG GAGCCTTCTT 960 

377 ACTGTTGAGG TTCAACAAGC TGAGGAGTAG ATGGGAATTT GAGGGAGAAT ACAGAGATAC 1020 

37 9 TATGAACGTA TTTGACATTT TTAATACAAT TTCTGCTATA ATTTTTGTAT GCAGTAGGCG 1080 

381 TTACTAATAA ACATTTCTGC TGTGAAAAAA AAAAAAAAAA AAAAAAAAAA A 1131 
385 (2) INFORMATION FOR SEQ ID NO: 9: 



387 (i) SEQUENCE CHARACTERISTICS: 

388 (A) LENGTH: 249 amino acids 

389 (B) TYPE: amino acid 

390 (C) STRANDEDNESS: 

391 (D) TOPOLOGY: linear 
393 (ii) MOLECULE TYPE: protein 

395 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

397 Met Gly Thr Pro Arg lie Gin His Leu Leu He Leu Leu Val Leu Gly 

398 15 10 15 

400 Ala Ser Leu Leu Thr Ser Gly Leu Glu Leu Tyr Cys Gin Lys Gly Leu 

401 20 25 30 

4 03 Ser Met Thr Val Glu Ala Asp Pro Ala Asn Met Phe Asn Trp Thr Thr 
404 35 40 45 

406 Glu Glu Val Glu Thr Cys Asp Lys Gly Ala Leu Cys Gin Glu Thr He 

407 50 55 60 

409 Leu He He Lys Ala Gly Thr Glu Thr Ala He Leu Ala Thr Lys Gly 

410 65 70 75 80 

412 Cys He Pro Glu Gly Glu Glu Ala He Thr He Val Gin His Ser Ser 

413 85 90 95 

415 Pro Pro Gly Leu He Val Thr Ser Tyr Ser Asn Tyr Cys Glu Asp Ser 
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L : 


1073 


M: 


341 


W 


(46) 


"n" 


or 


"Xaa" 


used, 


for 


SEQ 


ID# 


17 


L : 


1076 


M: 


341 


W 


(46) 


11 n" 


or 


"Xaa" 


used, 


for 


SEQ 


ID# 


17 


L : 


2370 


M: 


341 


w 


(46) 


"n" 


or 


"Xaa" 


used, 


for 


SEQ 


IDS 


42 


L : 


2411 


M: 


341 


w 


( 4 6) 


"n" 


or 


"Xaa" 


used , 


for 


SEQ 


ID# 


42 


L: 


2701 


M:341 


w 


(46) 


"n" 


or 


"Xaa" 


used , 


for 


SEQ 


ID# 


48 


L : 


2793 


M: 


341 


w 


(46) 


"n" 


or 


"Xaa" 


used , 


for 


SEQ 


ID# 


50 


L: 


3168 


M:341 


w 


(46) 


"n" 


or 


"Xaa" 


used, 


for 


SEQ 


ID# 


63 


L: 


4507 


M: 


341 


w 


(46) 


"n" 


or 


"Xaa" 


used, 


for 


SEQ 


ID# 


87 


L: 


4524 


M: 


341 


w 


(46) 


"n" 


or 


"Xaa" 


used, 


for 


SEQ 


ID# 


87 


L: 


4900 


M: 


341 


w 


(46) 


"n" 


or 


"Xaa" 


used, 


for 


SEQ 


ID# 


100 


L: 


5605 


M: 


341 


w 


(46) 


"n" 


or 


"Xaa" 


used, 


for 


SEQ 


ID# 


108 


L: 


5608 


M: 


341 


w 


(46) 


"n" 


or 


"Xaa" 


used, 


for 


SEQ 


ID# 


108 


L: 


5632 


M: 


341 


w 


(46) 


"n" 


or 


"Xaa" 


used, 


for 


SEQ 


ID# 


108 


L: 


5638 


M: 


341 


w 


(46) 


"n" 


or 


"Xaa" 


used, 


for 


SEQ 


ID# 


108 


L: 


6050 


M: 


341 


w 


(46) 


"n" 


or 


"Xaa" 


used, 


for 


SEQ 


ID# 


115 


L: 


6154 


M: 


341 


w 


(46) 


"n" 


or 


"Xaa" 


used, 


for 


SEQ 


ID# 


115 


L: 


6157 


M: 


341 


w 


(46) 


"n" 


or 


"Xaa" 


used, 


for 


SEQ 


ID# 


115 


L: 


7303 


M:341 


w 


(46) 


"n" 


or 


"Xaa" 


used, 


for 


SEQ 


ID# 


140 


L: 


7312 


M: 


341 


w 


(46) 


"n" 


or 


"Xaa" 


used, 


for 


SEQ 


ID# 


140 



L:10459 M:341 W: (46) "n" or "Xaa" used, for SEQ ID#:191 

L:10540 M:341 W: (46) "n" or "Xaa" used, for SEQ ID#:191 

L:10835 M:341 W: (46) "n" or "Xaa" used, for SEQ ID#:197 

L:11742 M:341 W: (46) "n" or "Xaa" used, for SEQ ID#:207 

L:11745 M:341 W: (46) "n" or "Xaa" used, for SEQ ID#:207 

L:11754 M:341 W: (46) "n" or "Xaa" used, for SEQ ID#:207 

L: 11766 M:341 W: (46) "n" or "Xaa" used, for SEQ ID#:207 
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fvips Not Cotnpty 

SEQUENCE LISTING VOV* ^03060 

4 (1) GENERAL INFORMATION: Corrected 

6 (i) APPLICANT: Jacobs, Kenneth / if 

1 McCoy, John M. pD* ' j 

8 LaVallie, Edward R. l' 

9 Collins-Racie, Lisa A. 

10 Evans, Cheryl 

11 Merberg, David 

12 Treacy, Maurice 

13 Spaulding, Vikki 

15 (ii) TITLE OF INVENTION: SECRETED PROTEINS AND POLYNUCLEOTIDES 

16 ENCODING THEM 
18 (iii) NUMBER OF SEQUENCES: 219 

20 (iv) CORRESPONDENCE ADDRESS: 

21 (A) ADDRESSEE: Genetics Institute, Inc. 

22 (B) STREET: 87 CambridgePark Drive 

23 (C) CITY: Cambridge 

24 (D) STATE: MA 

25 (E) COUNTRY: U.S.A. 

26 (F) ZIP : 02140 

28 (v) COMPUTER READABLE FORM: 

29 (A) MEDIUM TYPE: Floppy disk 

30 (B) COMPUTER: IBM PC compatible 

31 (C) OPERATING SYSTEM: PC-DOS/MS-DOS 

32 (D) SOFTWARE: Patentln Release #1.0, Version #1.30 
34 (vi) CURRENT APPLICATION DATA: 

C--> 35 (A) APPLICATION NUMBER: US/09/745 , 763 

C--> 36 (B) FILING DATE: 18-Jun-2000 

37 (C) CLASSIFICATION: 

39 (viii) ATTORNEY/AGENT INFORMATION: 

40 (A) NAME: Sprunger, Suzanne A. 

41 (B) REGISTRATION NUMBER: 41,323 

43 (ix) TELECOMMUNICATION INFORMATION: 

44 (A) TELEPHONE: (617) 498-8284 

45 (B) TELEFAX: (617) 876-5851 



ERRORED SEQUENCES 

1667 (2) INFORMATION FOR SEQ ID NO: 33: 

1669 (i) SEQUENCE CHARACTERISTICS: 

1670 (A) LENGTH: 2199 base pairs 

1671 (B) TYPE: nucleic acid 

1672 (C) STRANDEDNESS : double 

1673 (D) TOPOLOGY: linear 



1675 
1677 



Oi)_MOLEGUbE-TY P&~CDNA ^ J / £ a c* <S £f >* t <P 

""(xi) SEQUENCE DESCRIPTION: SEQ ID NoT"^ /^r^^t <^^>'&f~ a / 3 
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RAW SEQUENCE LISTING DATE: 06/28/2001 

PATENT APPLICATION: US/09/745,763 TIME: 14:39:43 

Input Set : A:\seqlist.txt 

Output Set: N:\CRF3\06282001\I745763.raw 

1679 AGTTGGCAGG TGGAGAGGCA GGTTGGGAGG GAAAGTCGGG GGAGGACGCG GAAGAGGAGC 60 

1681 TGTGGGAAGG GGGAGGAGGG AGGGAGGAAA AGAGGAGGAG GCGGAGGAGA ACTGAGCAGA 120 

1683 GCAGAGCATC GAGCCAAAGG GGAGATGAGT TTGTCTGTCC TCTGCTGAGG CTACGGCCGG 180 

1685 GCCTAGGGAA CTGGGAGCTT GGGTGGAAGC GACACCCGTG GAAGTGGGAG GAGGTGGCGC 240 

1687 CGGGACTTTA ACCCCTTGTG GGCTCTGCGG CAGGGGATTT AACCCTTTGT GGATCTGGCC 300 

1689 CCTCGGAGGC AGCGTCATCG GTAGTTTTAA CCCCTTCGGG GCTGGGTTTC ACGCACTGGA 360 

1691 CTTACCCTCA TCACCTTGCT CACCAACTCC TTTATTGGGG TGCTCCGCTT GGAGGTTTGA 420 

1693 GGCCCACCTC CGCCCATTAC GTACTGTTCC TGCCGCTGCA CCCCCTTGGA CCCGCTAGCT 480 

16 95 GGCCGCACTG TGGGCGCTTA ACCCTTTACT GACTTGAGCT CCCCAGATTG CAGTTGGAGT 54 0 
1697 TTGCTGATAG AAGGACTAGC TAAAGGCGTC ACTGCAGGAA TTACAAACTG AAGAGGACTC 600 
1699 TGTTGGACTG TTTTTTTTTT CTTTTTCTTT TTTTTAAGAA AAACCCATTT TTTTCCTTAA 660 
1701 GGACTTACTA GCCAAAATTT CTTAAACTTC GAGGACTCTA CTAGCCATGG CCGAGCCATT 720 
1703 CTTGTCAGAA TATCAACACC AGCCTCAAAC TAGCAACTGT ACAGGTGCTG CTGCTGTCCA 780 
1705 GG AAGAGCTG AACCCTGAGC GCCCCCCAGG CGCGGAGGAG CGGGTGCCCG AGGAGGACAG 84 0 
1707 TAGGTGGCAA TCGAGAGCGT TCCCCCAGTT GGGTGGCCGT CCGGGGCCGG AGGGGGAAGG 900 
1709 GAGCCTGGAA TCCCAACCAC CTCCCTTGCA GACCCAGGCC TGTCCAGAAT CTAGCTGCCT 960 
1711 GAGAGAGGGC GAGAAGGGCC AGAATGGGGA CGACTCGTCC GCTGGCGGCG ACTTCCCGCC 1020 
1713 GCCGGCAGAA GTGGAACCGA CGCCCGAGGC CGAGCTGCTC GCCCAGCCTT GTCATGACTC 1080 
1715 CGAGGCCAGT AAGTTGGGGG CTCCTGCCGC AGGGGGCGAA GAGGAGTGGG GACAGCAGCA 114 0 
1717 GAGACAGCTG GGGAAGAAAA AACATAGGAG ACGCCCGTCC AAGAAGAAGC GGCATTGGAA 1200 
1719 ACCGTACTAC AAGCTGACCT GGGAAGAGAA GAAAAAGTTC GACGAGAAAC AGAGCCTTCG 1260 
1721 AGCTTCAAGG ATCCGAGCCG AGATGTTCGC CAAGGGCCAG CCGGTCGCGC CCTATAACAC 1320 
1723 CACGCAGTTC CTCATGGATG ATCACGACCA GGAGGAGCCG GATCTCAAAA CCGGCCTGTA 1380 
1725 CTCCAAGCGG GCCGCCGCCA AATCCGACGA CACCAGCGAT GACGACTTCA TGGAAGAAGG 144 0 
1727 GGGTGAGGAG GATGGGGGCA GCGATGGGAT GGGAGGGGAC GGCAGCGAGT TTCTGCAGCG 1500 
1729 GGACTTCTCG GAGACGTACG AGCGGTACCA CACGGAGAGC CTGCAGAACA TGAGCAAGCA 1560 

17 31 GGAGCTCATC AAGGAGTACC TGGAACTGGA GAAGTGCCTC TCGCGCATGG AGGACGAGAA 1620 
1733 CAACCGGCTG CGGCTGGAGA GCAAGCGGCT GGGTGGCGAC GACGCGCGTG TGCGGGAGCT 1680 
1735 GGAGCTGGAG CTGGACCGGC TGCGCGCCGA GAACCTCCAG CTGCTGACCG AGAACGAACT 174 0 
17 37 GCACCGGCAG CAGGAGCGAG CGCCGCTTTC CAAGTTTGGA GACTAGACTG AAACTTTTTT 1800 
1739 GGGGGAGGGG GCAAAGGGGA CTTTTTACAG TGATGGAATG TAACATTATA TACATGTGTA 1860 
1741 TATAAGACAG TGGACCTTTT TATGACACAT AATCAGAAGA GAAATCCCCC TGGCTTTGGT 1920 
1743 TGGTTTCGTA AATTTAGCTA TATGTAGCTT GCGTGCTTTC TCCTGTTCTT TTAATTATGT 1980 
1745 GAAACTGAAG AGTTGCTTTT CTTGTTTTCC TTTTTAGAAG TTTTTTTCCT TAATGTGAAA 2040 
1747 GTAATTTGAC CAAGTTATAA TGCATTTTTG TTTTTAACAA ATCCCCTCCT TAAACGGAGC 2100 
1749 TATAAGGTGG CCAAATCTGA GAACAATTAA ATTCATTTTA GTTATAATAA ATTTAATATT 2160 
1751 TGTAAATGTA AAAAAAAAAA AAAAAAAAAA AAAAAAAAA 2199 
12067 (2) INFORMATION FOR SEQ ID NO: 219: 



12069 


(i) 


SEQUENCE CHARACTERISTICS: 












12070 




(A) LENGTH: 542 amino acids 












12071 




(B) TYPE: amino acid 












12072 




(C) STRANDEDNESS : 












12073 




(D) TOPOLOGY: linear 












12075 


(ii) 


MOLECULE TYPE: protein 












12077 


(xi) 


SEQUENCE DESCRIPTION: SEQ ID NO: 


: 219: 










12079 


Cys 


Gly His His Glu Leu Asn Asn Leu 


Asn Leu 


Thr 


Gin 


Val 


Gin 


12080 


1 


5 


10 








15 


12082 


Arg 


Asn Leu lie Thr Asn Leu Gin Arg 


Ser Val 


Asp 


Asp 


Thr 


Ser 


12083 




20 25 








30 
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RAW SEQUENCE LISTING 

PATENT APPLICATION: US/09/74 5,763 



DATE: 06/28/2001 
TIME: 14 : 39 : 45 



12085 
12086 
12088 
12089 
12091 
12092 
12094 
12095 
12097 
12098 
12100 
12101 
12103 
12104 
12106 
12107 
12109 
12110 
12112 
12113 
12115 
12116 
12118 
12119 
12121 
12122 
12124 
12125 
12127 
12128 
12130 
12131 
12133 
12134 
12136 
12137 
12139 
12140 
12142 
12143 
12145 
12146 
12148 
12149 
12151 
12152 
12154 
12155 
12157 



Input Set : A:\seqlist.txt 

Output Set: N:\CRF3\06282001\I745763.raw 

Ala lie Gin Arg lie Lys Asn Asp Phe Gin Asn Leu Gin Gin Val Phe 

35 40 45 

Leu Gin Ala Lys Lys Asp Thr Asp Trp Leu Lys Glu Lys Val Gin Ser 

50 55 60 

Leu Gin Thr Leu Ala Ala Asn Asn Ser Ala Leu Ala Lys Ala Asn Asn 
65 70 75 80 

Asp Thr Leu Glu Asp Met Asn Ser Gin Leu Asn Ser Phe Thr Gly Gin 

85 90 95 

Met Glu Asn lie Thr Thr lie Ser Gin Ala Asn Glu Gin Asn Leu Lys 

100 105 110 

Asp Leu Gin Asp Leu His Lys Asp Ala Glu Asn Arg Thr Ala lie Lys 

115 120 125 

Phe Asn Gin Leu Glu Glu Arg Phe Gin Leu Phe Glu Thr Asp lie Val 

130 135 140 

Asn lie lie Ser Asn lie Ser Tyr Thr Ala His His Leu Arg Thr Leu 
145 150 155 160 

Thr Ser Asn Leu Asn Glu Val Arg Thr Thr Cys Thr Asp Thr Leu Thr 

165 170 175 

Lys His Thr Asp Asp Leu Thr Ser Leu Asn Asn Thr Leu Ala Asn lie 

180 185 190 

Arg Leu Asp Ser Val Ser Leu Arg Met Gin Gin Asp Leu Met Arg Ser 

195 200 205 

Arg Leu Asp Thr Glu Val Ala Asn Leu Ser Val lie Met Glu Glu Met 

210 215 220 

Lys Leu Val Asp Ser Lys His Gly Gin Leu lie Lys Asn Phe Thr lie 
225 230 235 240 

Leu Gin Gly Pro Pro Gly Pro Arg Gly Pro Arg Gly Asp Arg Gly Ser 

245 250 255 

Gin Gly Pro Pro Gly Pro Thr Gly Asn Lys Gly Gin Lys Gly Glu Lys 

260 265 270 

Gly Glu Pro Gly Pro Pro Gly Pro Ala Gly Glu Arg Gly Pro lie Gly 

275 280 285 

Pro Ala Gly Pro Pro Gly Glu Arg Gly Gly Lys Gly Ser Lys Gly Ser 

290 295 300 

Gin Gly Pro Lys Gly Ser Arg Gly Ser Pro Gly Lys Pro Gly Pro Gin 
305 310 315 320 

Gly Pro Ser Gly Asp Pro Gly Pro Pro Gly Pro Pro Gly Lys Glu Gly 

325 330 335 

Leu Pro Gly Pro Gin Gly Pro Pro Gly Phe Gin Gly Leu Gin Gly Thr 

340 345 350 

Val Gly Glu Pro Gly Val Pro Gly Pro Arg Gly Leu Pro Gly Leu Pro 

355 360 365 

Gly Val Pro Gly Met Pro Gly Pro Lys Gly Pro Pro Gly Pro Pro Gly 

370 375 380 

Pro Ser Gly Ala Val Val Pro Leu Ala Leu Gin Asn Glu Pro Thr Pro 
385 390 395 400 

Ala Pro Glu Asp Asn Ser Cys Pro Pro His Trp Lys Asn Phe Thr Asp 

405 410 415 

Lys Cys Tyr Tyr Phe Ser Val Glu Lys Glu lie Phe Glu Asp Ala Lys 
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RAW SEQUENCE LISTING 

PATENT APPLICATION: US/09/74 5,763 



DATE: 06/28/2001 
TIME: 14:39:45 



Input Set : A:\seqlist.txt 

Output Set: N:\CRF3\06282001\I745763.raw 



12158 








420 










425 










430 






12160 


Leu 


Phe 


Cys 


Glu 


Asp 


Lys 


Ser 


Ser 


His 


Leu 


Val 


Phe 


He 


Asn 


Thr 


Arg 


12161 






435 










440 










445 








12163 


Glu 


Glu 


Gin 


Gin 


Trp 


He 


Lys 


Lys 


Gin 


Met 


Val 


Gly 


Arg 


Glu 


Ser 


His 


12164 




450 










455 










460 










12166 


Trp 


He 


Gly 


Leu 


Thr 


A sp 


Ser 


Glu 


Arg 


Glu 


Asn 


Glu 


Trp 


Lys 


Trp 


Leu 


12167 


465 










470 










475 










480 


12169 


Asp 


Gly 


Thr 


Ser 


Pro 


Asp 


Tyr 


Lys 


Asn 


Trp 


Lys 


Ala 


Gly Gin 


Pro 


Asp 


12170 










485 










490 










495 




12172 


Asn 


Trp 


Gly 


His 


Gly 


His 


Gly 


Pro 


Gly 


Glu 


Asp 


Cys 


Ala 


Gly 


Leu 


He 


12173 








500 










505 










510 






12175 


Tyr 


Ala 


Gly 


Gin 


Trp 


Asn 


Asp 


Phe 


Gin 


Cys 


Glu 


Asp 


Val 


Asn 


Asn 


Phe 


12176 






515 










520 










525 








12178 


He 


Cys 


Glu 


Lys 


Asp 


Arg 


Glu 


Thr 


Val 


Leu 


Ser 


Ser 


Ala 


Leu 






12179 




530 










535 










540 










E--> 12180(^ 


^Tes - 
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VERIFICATION SUMMARY 

PATENT APPLICATION: US/09/74 5 , 763 



DATE: 06/28/2001 
TIME: 14:39:46 



L: 
L: 
L: 
L: 
L: 
L: 
L: 
L: 
L: 
L: 
L: 
L: 
L: 
L: 
L: 
L: 
L: 
L: 
L: 
L: 
L: 
L: 
L: 
L: 
L: 
L: 
L: 
L: 
L: 
L: 
L: 
L: 



35 M:220 C 

36 M:220 C 
1073 M:341 
1076 M:341 
1677 M:220 
0 M:200 E: 
1677 M:202 
2370 M:341 
2411 M:341 

M: 341 
M: 341 
M:341 
M: 341 
M: 341 
M:341 
M:341 



2701 
2793 
3168 
4507 
4524 
4900 
5605 
5608 
5632 
5638 
6050 
6154 
6157 
7303 
7312 
10459 
10540 
10835 
11742 
11745 
11754 
11766 
12180 



Input Set : A:\seqlist.txt 

Output Set: N:\CRF3\06282001\I745763.raw 



Keyword misspelled or invalid format, 
Keyword misspelled or invalid format, 



[(A) APPLICATION NUMBER : ] 
[ (B) FILING DATE: ] 



(46) "n" or "Xaa" used, for SEQ ID#:17 
(46) "n" or "Xaa" used, for SEQ ID#:17 

Keyword misspelled or invalid format, [(xi) SEQUENCE DESCRIPTION: 



SEQ ID NO : ] 



Mandatory Header Field missing, SeqNo=33, SEQUENCE DESCRIPTION: SEQ ID NO: of (2) 



:341 
:341 
:341 
:341 
:341 
M: 341 
M: 341 
M: 341 
M:341 
M: 341 
M: 341 
M: 341 
M:341 
M:341 
M: 341 
M: 332 



(16) Value must be an Integer, Data=[] 
(46) "n" or "Xaa" used, for SEQ ID#:42 
(46) "n" or "Xaa" used, for SEQ ID#:42 
(46) "n" or "Xaa" used, for SEQ ID#:48 
(46) "n" or "Xaa" used, for SEQ ID#:50 
(46) "n" or "Xaa" used, for SEQ ID#:63 
(46) "n" or "Xaa" used, for SEQ ID#:87 
(46) "n" or "Xaa" used, for SEQ ID#:87 
(46) "n" or "Xaa" used, for SEQ ID#:100 
(46) "n" or "Xaa" used, for SEQ ID#:108 
(46) "n" or "Xaa" used, for SEQ ID#:108 
(46) "n" or "Xaa" used, for SEQ ID#:10 8 
(46) "n"- or "Xaa" used, for SEQ ID#:108 
(46) "n" or "Xaa" used, for SEQ ID#:115 
(46) "n" or "Xaa" used, for SEQ ID#:115 
(46) "n" or "Xaa" used, for SEQ ID#:115 
(46) "n" or "Xaa" used, for SEQ ID#:140 
(46) "n" or "Xaa" used, for SEQ ID#:140 
(46) "n" or "Xaa" used, for SEQ ID#:191 
(46) "n". or "Xaa" used, for SEQ ID#:191 
(46) "n" or "Xaa" used, for SEQ ID# : 197 
(46) "n" or "Xaa" used, for SEQ ID#:207 
(46) "n" or "Xaa" used, for SEQ ID#:207 
(46) "n" or "Xaa" used, for SEQ ID#:207 
(46) "n" or "Xaa" used, for SEQ ID#:207 
(32) 



Invalid/Missing Amino Acid Numbering, SEQ ID: 219 
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